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THE PHYLOGENY OF TRIBE ZINGIBEREAE 
(ZINGIBERACEAE) BASED ON ITS (nrDNA) AND 
trnL-F (cpDNA) SEQUENCES 


С. NGAMRIABSAKUL*, М. Е. NEWMANT & О. С. В. Скомктї 


A phylogenetic analysis of the tribe Zingibereae (Zingiberaceae) was performed using 
nuclear ribosomal DNA (ITS1, 5.85 and ITS2) and chloroplast DNA (trnL (UAA) 5’ 
exon to trnF (GAA)). The tribe is monophyletic with two major clades, the Curcuma 
clade and the Hedychium clade. Paracautleya, sampled for the first time, comes out as 
predicted while Caulokaempferia comes out in a different position from that found in 
another recent study. The genera Boesenbergia and Curcuma are apparently not 
monophyletic. 


Keywords. Molecular systematics, phylogeny, Zingiberaceae, Zingibereae. 


INTRODUCTION 


The tropical monocotyledonous family Zingiberaceae is highly natural, containing 
some 53 genera mostly occurring from India to New Guinea. Until recently the 
genera were grouped into three or four tribes, mainly according to the nature of 
the lateral staminodes. Four subfamilies are currently recognized, two of them 
monogeneric, the other two with two tribes each (Kress et al., 2002). 

The tribe Zingibereae contains 25 genera, Caulokaempferia K. Larsen being 
unplaced. All genera so far examined have the plane of leaf distichy parallel to the 
direction of rhizome growth, and most have large, petaloid lateral staminodes. The 
ovary is trilocular with axile placentation, or is clearly derived from this condition, 
and chromosome numbers mostly vary according to the genus. 

Kress et al. (2002) were able to sample widely across the Zingiberaceae, providing 
sequences of the ITS and matK (chloroplast DNA) genes for 104 species in 41 
genera. However, they were unable to obtain sequences of 12 genera. 

Previous studies in the Zingibereae at the Royal Botanic Garden Edinburgh 
(RBGE) have focused on the genus Roscoea Sm. (Ngamriabsakul et al., 2000). Неге 
we build on that study, examining the genera of the Zingibereae in detail. We have 
obtained material of Paracautleya R.M. Sm., which was among the genera not avail- 
able to Kress et al. (2002), and have used the same nuclear gene as Kress et al. (2002) 
but a different chloroplast gene, trnL-F. 
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MATERIALS AND METHODS 
Plant material 


Many of the samples were taken from the research collections at RBGE. Another 
important source was a field trip in Thailand during July and August 1999 (C.N. 
and M.F.N.). Other Zingiberaceae researchers have also provided samples. Table 1 
gives details of the plants sequenced, as well as authorities for all species. 


Outgroup selection 


Early phylogenetic studies within the family (Soltis et al., 2000; Wilf et al., 2000; 
Wood et al., 2000) appear to show unambiguously that the tribe Alpinieae was 
the basal branch in the family, followed by Globbeae, Hedychieae and Zingibereae. 
Our study was conducted before Kress et al. (2002) discovered that Siphonochilus 
J.M. Wood & Franks was basal so we chose three species in the Alpinieae, Alpinia 
galanga, Renealmia battenbergiana and Pleuranthodium schlechteri, as the outgroup 
because living plants were available at RBGE and had been used in a previous study 
(Rangsiruji et al., 20005). This choice does not affect the results adversely since these 
three species are still clearly outside the clade being studied. 


Ingroup selection 


For the ingroup taxa, 34 species from 16 genera were sequenced (see Table 1). The 
species were chosen to represent variation within each genus, in terms of both total 
number of species and distribution. At least 10% of the species in each genus were 
included to avoid any excess heterogeneity in rates of molecular evolution that may 
be found. Table 1 shows the genus names, the numbers of species so far described 
and the number included in this study. 

ITS sequences of some missing genera and additional species of Zingibereae were 
obtained from GenBank: 


Haniffia cyanescens (Ridl.) Holttum (AF202407); two species in genus 

e Hitchenia glauca Wall. (AF202413); three species in genus 

Pommereschea lackneri Wittm. (AF202405); two species in genus 

Rhynchanthus beesianus W.W. Sm. (AF202415; Wood et al., 2000); seven species 
in genus 

Camptandra ovata Ridl. (AJ388302); three species in genus 

Boesenbergia cordata R.M. Sm. (AJ388277; Searle & Hedderson, 2000); c.50 
species in genus. 


The remaining five genera in Zingibereae, as circumscribed by Kress et al. (2002), 
which could not be included owing to lack of material, are Haplochorema K. Schum. 
(three or four species), Laosanthus K. Larsen & Jenjitt. (monotypic), Nanochilus K. 
Schum. (monotypic), Parakaempferia A.S. Rao & D.M. Verma (monotypic), and 
Stadiochilus R.M. Sm (monotypic). 


485 


PHYLOGENY OF TRIBE ZINGIBEREAE (ZINGIBERACEAE) 





ис 
ис 
1/01 


ат 


5/06 


Ариз$ sty} ш рорпјош 
sərvəds Jo том 
/sarseds Jo лодшти јејој 


E8LVcvAV 


C8LVCVAV 


І8/ӮСРЛУ 


08/7С7АУ 


6//7СРЛУ 


8/7СРАУ 


LLLYCVAV 


OLLVCVAV 


СИЛУ 


4-10 


OSLAV 
6бУ/7РАУ 
ЗИРСТАМ 
LvyLycvAV 
OPLVCVAV 
СИТАУ 
ТИТСЛУ 
evlLycvAV 


СПРАМ 


IpLycvAV 
0ғ/ ?СУАУ 


бЕ/7СРЛУ 


TSLI рие 6857181 


Joquinu 001689020 уивяиәгу 


(D сє ижзарыит8 N 

:<9116661 ОЯМ ‘purpreyL 
(Я) 05 Іпурвфомшоӛұ 

:09106561 ЧОЯЯ 


(4 A8) 19 
1пурѕдрашр8у ‘разиечл, 


— їшцуелд] “алеје 

(9 “1ЯЯ) %6 
иршмә :рчепечт, 

(9 “1ЯЯ) роб 
иршмәу ‘ризиечТ, 

(я '4 28) 506 
иршмәу ‘ризиечт, 

(D 97 /пурѕ411шр8 ү 
129915861 4084 

(D 6с 1psqouv3 N 
:780<861 8084 


(Я) 28780 (Я) LZ 
inussuvy ‘у 0100161 ЯОЯЯ 

(Я) $ 4400 2 
:89198$/ IVM 

(а) с иплевиру ‘у 
71011461 ЧОЯЯ 


лоцопол рие лодшти 901559998 “21106 


пәвіРТ "У % роој 
рјопгфвиој 
тагјашорупилођ 76 
Joyeg CWS) 
2122145 рбәрпоә ‘8 
чподиј, 29 usq ‘У 
ргорјола тагјашгруојпро 71, 
"ТРЕЯ (лозе) 
рјпалра рарипашро 79 


240]д8ио] "де g "$ 

ап (ел) 
раојлвиој 74 Ср 

тесіле Б 
00и110]28 4 << 

BSSNILNG хә моле] У 
21221451509 J "С 

WS WA 
DIDYUDAND ра8лодигк2од `I 
пои] 


Joyeg хә зип) 
ритзлодиг та рішүрәиәу 
Ug ИГ Сшпуоб У) 

149729125 штроцјирлпаја 


PIEM CT) гзивјов рші 
41018310 


чохеј, 


влодшти 901559598 YURGUID рив (оја веле у) лоцопол ‘JaquINU иотѕѕәоов ‘nos уум Ариз$ sty} ш psouonbes хеј “| ATAVL 


С. NGAMRIABSAKUL ET AL. 


486 





ІЛ 


v/0v 


5/06 


ис 


9/05 


Ариз$ $03 ш рорпјош 
вәтоәй Jo IoquinN 
/sarsads Jo лодшти јејој 


COLVCVAV 
16/1 7СУЛУ 
06/7С7АУ 


68/7РАУ 
88/7<7АУ 


18477 АҮ 


98/7С7АУ 


68/7С7АУ 


РАСТА 
4 ли 


99/7С7АУ 
СОРСТАМ 
РОРСТАМ 
СОУРСТАМ 
СИРСФТАМ 
19/7С7АУ 
09/7С<7АУ 


65[7РАУ 
851 7РАУ 


LSLYCVAV 
9С/7С7АУ 


SSLYCVAV 
УО/7СЛАУ 
СИРСРТАМ 
CSLYCVAV 


ISLpcvpAV 
6811 рие 585 ‘ISLI 


Joquinu 901559098 Хавяиәг) 


(9) бре “HOM зеца “вірш 
(9) 8с /турѕфтлшр8 ү 

:81906561 ЧОЯЯ 

- 19009861 TOTA 

(Я “<13Я) 6/8 иршиам ‘ризпечл, 


= 16910961 ЧОЯЯ 
(4728) 916 иршиам ‘PULL 
— 76710661 ЧОЯЯ 
- 18909961 8084 
(9) LZ Іпурвдиышоду 
00101661 OQA 
= ‘90815461 ЧОЯЧЯ 
(4) pz /пурѕдрлшр8 ү 
:<9710661 ЧОЯЯ 
— 0580111 гривреуј, 
(Я 429) сє 
1пурѕдрлшр8у ‘разиечт, 
(я “AM) 8r 
1пурѕдртлшр8у ‘разиечл, 
(Я “1Я8) 86 
І|пүпвармшт5 М ‘разиечт, 
(9) LZ иоарау W 
‘10001861 OQA 


(Я “43Я8) %6 
иршмәр :риепечтТ, 


JOYONOA рие лодшти 01559998 “оолпоб 


WS шпәші202 штуодргн 


ТЕТЕ 
туд рАгртрорара 


"1 рритол У 

Коле рилазари `У 

"ем хирбо|ә М 
90280 

pyofisnsuv тагјашгрој 

ем шпора Н 

ШН МУ 20100004 х Н 


90950 шпитагирлрв `H 


иешљмәм “A 
224112 SAUIDJYIOYINSIG 
"хо зиговадпа `J 


"ем 240] Дара 5 
‘dousey приримру `p ` 
91810) рјошоәә `) ` 
хоч орршр `p · 


douge p 
руојпршхјр DUNIAND ` 


‘9C 


“ST 
‘VC 
“с 


CC 
‘IC 
`0 
'61 


781 
‘LI 


791 
“SI 


ҮТ 


ЄТ 


СІ 


П 


01 


uoxR 


(2.109) СІ ATAV L$ 


487 


PHYLOGENY OF TRIBE ZINGIBEREAE (ZINGIBERACEAE) 





1188 


1/9 


ІП 


а0с 


ав1 


ІП 


Apis sty} ш рорпрш 
$919э4$ Jo одштм 
[зэтоэ@$ Jo лодшти oL 


ООВРСРТАМ 


66/ӮСРЛУ 


86/7СРЛУ 


LOLVCVAV 


ОСАМ 


S6LVCVAV 


voLVCVAV 


COLVCVAV 


4“ ли. 


PLLYCVAV 


eLlLycvAV 


CLLVCVAV 
ILLVCvAV 
OLLVCVAV 
69LVCVAV 


89/7<7АУ 


1947 7 АҮ 
TSLI pure 5857150 


лофшпи UOIssadoR ХиеЯиәг) 


(Я “1ЯЯ) #56 иршмам 
69116661 ОЯМ ‘реет. 
(A) pe Іпур84омшр8 

10218661 ADGA 


(а) 10118000 
DUDIDAIJUDSYOOSIDE `X 


‘вивуелниезуоозтеа ‘ризеч, 


— {175861 ЗОЯЯ 

(8) сс /пурѕ9рышр8у 
‘TETEPIGL TOTA 

(9) 8 jnyosqninuns Ny 
:0191/861 8084 

(а) cz jnyosquisus N 
“9117861 ЧОЯЯ 


(А) се jnyosqniaunsN 
51510661 ЯОЯЯ 


тәцопол рив Joquinu 001559996 “золпоб 


‘dousey шпазинг л2д18ш7 


Ws “АГ (19508) 
SNIDAIN] ода ul 5111 их 


молер 74 29 SSNY ‘ГМ 
әрирәирлапѕ SIDUS 


шпон 
(Трга) 220/0әәиру 5 
шпон aeg) 
мојвипу $шрцәоцӣрэѕЅ 
WS АҒА 5 FILA 
рирәшт М 
"Чет м 
роиртуд 0202504 
uo AZ 9 
пм "ТТ. (92985) 
sisuaupuund штј удова СТ 


VE 


“CE 


‘CE 


‘TE 


"ДЕ 


‘6с 


8с 


‘LT 


похед, 


(рлиоо) СТ 81Я4У1 


488 С. NGAMRIABSAKUL ET AL. 





Total genomic DNA extraction 


The CTAB method (Doyle & Doyle, 1990) was used to obtain total DNA of plant 
cells. Fresh leaf samples were taken and kept in dry silica gel before DNA extrac- 
tion. The modified protocol for DNA extraction followed our previous study 
(Ngamriabsakul et al., 2000). The QIAgen Dneasy kit (QIAGEN, 1997) with liquid 
nitrogen was also used, with few modifications, to give high quality total DNA. 
Incubation times were increased to 30 and 10 minutes, instead of 10 and 5 minutes, 
in steps three and four of the protocol, respectively. 


PCR amplification and DNA sequencing 


Each PCR reaction was 50Ш in volume. The PCR reaction mix was prepared before 
aliquoting to each tube and adding template DNA as the last component. The 
components and the conditions of the PCR followed Ngamriabsakul et а!. (2000), but 
with primer volume decreased to 2ul instead of 5ш. No significant reduction in 
products was detected. The ІТ5І, 5.85 and ITS2 complete region was amplified 
by using primers ‘5P’ and ‘8P’ (Moller & Cronk, 1997). ITS1 and ITS2 had to be 
amplified separately for some species. Primer ‘5P’ and primer ‘2K’ (Rangsiruji, 1999) 
were then used to amplify ITS1, while primer ‘3P’ and primer ‘8P’ were used for ITS2. 

PCR amplification of trnL—F with primers ‘c’ and ‘f’ (Taberlet et al., 1991) was 
found to show more than one distinct band in some species, e.g. two when using the 
conditions described for ITS (Ngamriabsakul et al., 2000). Various conditions for 
the PCR reaction were then tried. It was found that when using primers “с” and ‘f’ to 
amplify some Zingiberaceae DNA, optimal conditions were needed, e.g. a smaller 
volume of primer and a well-calibrated thermocycler. In cases where a second band 
could still be observed after amplification, ће trnL—F region was then further ampli- 
fied using two sets of primers. Primers ‘c’ and ‘4’, along with ‘e’ and ‘f’, were used to 
separate amplifications of trnL intron and trnL—F spacers, respectively. АП the 
products of primers ‘c’ and ‘f’ (a complete region of trnL intron and trnL-F spacer), 
“с” and “а” (trnL intron), and ‘e’ and ‘f’ (trnL—F spacer) were successfully obtained 
as a single band. PCR products were purified before automated cycle sequencing 
using а QIAquick™ PCR purification kit. Forward and reverse sequencings, using 
the same primers as for PCR reactions, were performed for sequence confirmation 
as described in Ngamriabsakul ег а/. (2000). The primer sequences used in this study 
аге (5' to 3), 5P=GGA AGG AGA AGT CGT AAC ААС С, 8P=CAC ССТ 
TCT CCA GAC TAC A, 2K = ССС ACA АСТ TGC GTT САА AG, 3P =GCA 
TCG ATG AAG AAC GTA GC, c=CGA AAT CGG TAG ACG CTA CG, 
d=GGG GAT AGA GGG ACT TGA AC, e=GGT TCA AGT CCC TCT ATC 
CC, f=ATT TGA ACT GGT GAC ACG AG. 


Sequence analysis 


All sequences were verified by comparison of their forward and reverse sequences 
in Autoassembler™ (Applied Biosystems Division). Sequence boundaries of the 
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range of ITS1, 5.85 and ITS2 in all taxa were determined by comparison with publi- 
shed sequence data for Roscoea (Ngamriabsakul et al., 2000) and Alpinia Roxb. 
(Rangsiruji et al., 2000a). All sequences are deposited in GenBank (see Table 1). 

The sequences were aligned using CLUSTAL_X (Thompson et al., 1997; Hickson 
et al., 2000) with default values (e.g. gap-opening cost = 15) and manual adjustment 
in only the first alignment. Because of the high similarity in length and nucleotides 
of the trnL—F sequences (see Table 2), a sensitivity test of alignment was performed 
only for the ITS data set by varying the gap-opening cost to 5, 10, 20 and 25 to 
yield four other different alignments (Jeanmougin et al., 1998). The alignments were 
directly submitted to parsimony analysis. This was to determine the effects of align- 
ment and gaps in the ITS data set on the resulting phylogenetic estimates. Character 
congruence is advocated as both an internal criterion (Bogler & Simpson, 1996) and 
an external criterion (Giribet & Wheeler, 1999) for choosing the best alignment 
based on parsimony. Thus we chose a rescaled consistency (RC) index for each 
analysis (Bogler & Simpson, 1996) and P-values of the homogeneity test for each of 
the differently aligned ITS data sets and the trnL—F data set as indicators of optimal 
alignment. 

А transition/transversion ratio was determined using MacClade version 3.0 
(Maddison & Maddison, 1992) on one of the most parsimonious trees from the 
unweighted initial analysis. The G+C content and sequence divergence among taxa 


TABLE 2. Sequence characteristics of nuclear ribosomal DNA (ITS1, 5.85, ITS2) and 
chloroplast DNA (trnL-F) 


Parameter ІТЅІ, 5.85, ITS2 trnL—-F 
Length range (total) (bp) 573-672 894-960 
Length mean (total) (bp) 591.24 913.04 
Length range (ingroup) (bp) 577-672 894-960 
Length mean (ingroup) (bp) 592.00 913.52 
Length range (outgroup) (bp) 573-591 906-914 
Length mean (outgroup) (bp) 582.00 909.33 
Aligned length (bp) 722 (662)* 1008 
G+C content range (%) 52.30-59.82 31.35-33.41 
а + С content mean (%) 55.71 32.78 
Sequence divergence (ingroup) (%) 0.00–23.89 0.11–2.50 
Sequence divergence (in/outgroup) (%) 9.98—23.75 1.79-3.88 
Number of variable sites (% in parentheses) 342 (51.66)* 123 (12.20) 
Number of constant sites (% in parentheses) 320 (48.34)* 885 (87.80) 
Number of informative site (% in parentheses) 213 (32.17)* 38 (3.77) 
Number of autapomorphic sites (% in parentheses) 129 (19.49)* 85 (8.43) 
Transitions (unambiguous) 483 30 
Transversions (unambiguous) 258 38 
Transitions/transversions (ts/tv) 1.87 0.79 
Average number of steps per character 1.414 0.149 


*662 bp is the length of the ITS data set used in the analyses. 
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were calculated using ‘Base frequencies’ and ‘Show pairwise distance’ options in 
PAUP* Version 4.0b4 (Swofford, 1998). 


Phylogenetic analysis 


Phylogenetic trees were generated using PAUP* Version 4.0b4 (Swofford, 1998), 
with character states unordered and at first equally weighted. The heuristic search 
was set to 1000 replicates with random addition sequence and TBR (tree bisection— 
reconnection) branch swapping. Polymorphic characters were treated as uncertain. 
Gaps were treated as missing values. ‘Mulpars’ and ‘Steepest descent’ were the 
search options selected. ‘Acctran’ (accelerated transformation) was chosen for 
character optimization. 

A partition homogeneity test (Farris et al., 1994), also known as the incongruence 
length difference (ILD) test, was carried out in PAUP* with the heuristic search set 
to 1000 replicates and 10 replicates of random addition sequence, TBR and 
‘Mulpars’, to test the hypothesis that the two data sets, ITS and trnL-F, contain the 
same phylogenetic information. The ITS data set was reduced to 26 taxa to match 
the 26 taxa trnL-F data set for the test. The results suggest that both data sets are 
congruent (P-value >0.05) and can be combined. A combined analysis of both data 
sets was performed using the same phylogenetic methods and parameters as above. 

Successive weighting searches were performed using a rescaled consistency 
(RC) index (mean value; Swofford, 1993) until the resulting tree length remained 
unchanged in two consecutive rounds. Due to the high value of transitions found 
in the ITS data matrix (65%), the transition/transversion ratio (ts/tv=1/1.87) 
was applied to a parsimony analysis of the data set to weight transversions over 
transitions. 

Support for individual clades was given a bootstrap value (Felsenstein, 1985) 
and a decay index (Bremer, 1988; Donoghue et al., 1992). Bootstrap analysis was 
performed using PAUP%* set to heuristic search with 1000 replicates, TBR and 10 
random addition sequence replicates per heuristic search. In the results and discus- 
sion presented here, clades with bootstrap (BS) values of 50-74% represent weak 
support, 75-84% moderate support and 85-100% strong support (Richardson et al., 
2000). The decay index (DI) was calculated using Autodecay version 4.0 (Eriksson, 
1998) with 10 random addition sequence replicates per heuristic search. 

Maximum likelihood analysis was performed for the ITS data set in PAUP* by 
applying the best fit likelihood model, TrN+G, resulting from the likelihood ratio 
test (Huelsenbeck & Rannala, 1997) using MODELTEST version 3.0 (Posada & 
Crandall, 1998). The substitution model used allows unequal base frequencies, 
unequal transition and transversion rates, and among-site rate heterogeneity. 


RESULTS 


The ITS sequences of 36 species were obtained in this study and sequences of 
six other ingroup taxa were taken from GenBank. The sequences of Kaempferia 
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marginata were unreadable due to sequence polymorphism. In total, there are 42 
taxa in the ITS data matrix and 26 taxa in the trnL-F data matrix. The trnL-F data 
matrix is smaller than the ITS data matrix for two reasons. Firstly, the taxa in 
GenBank have not been sequenced for trnL—F (or the sequences are not yet avail- 
able) and secondly, the trnL—F region in some of our own DNA samples proved 
difficult to amplify and sequence. 


Best alignment of the ITS data set 


The alignment of the ITS data set with default values (i.e. gap-opening cost = 15) 
in CLUSTAL_X gave the highest RC value when the data set was analysed to find 
the most parsimonious trees. Four other values of gap-opening in CLUSTAL_X, 
namely 5, 10, 20 and 25, gave alignments different from the default value. RC values 
of these different alignments by parsimony analysis were lower than those of the 
first alignment without manual adjustment (data not shown). The default value 
alignment that gave the highest RC value was further improved by manual adjust- 
ment and when analysed, the resulting RC was slightly higher than the alignment 
without manual adjustment (data not shown). 

The P-value of the initial homogeneity test of both data sets suggests that the 
phylogenetic signals contained in the data sets are homogeneous and can be com- 
bined (P-value>0.05). It is assumed that the alignment of the ITS data set that 
yields the highest P-value when used in the homogeneity test represents the best 
alignment. The assumption is that the data sets are parts of one big data set of all 
taxa and that any partition of it will lead to the same phylogenetic estimate. The 
P-value for the homogeneity test of the trnL—F data set and the first alignment of 
the ITS data set, with default value and manual adjustment, was higher than values 
derived from other alignments of the ITS data set (data not shown). This is thus the 
best alignment of the ITS data set found and is that used in the rest of this study. 


Sequence analysis of the ITS region 


Alignment of the ITS sequences of the 42 taxa analysed resulted in a 722-bp long 
data matrix. As 60 bp were excluded because of alignment ambiguities, a data 
matrix 662 bp long was subject to analyses. Its characteristics are given in Table 2. 
Two sequences taken from GenBank, Boesenbergia cordata and Camptandra ovata, 
lacked the first 23 and 25 bp of 1151, respectively. Scaphochlamys kunstleri and 5. 
lanceolata lacked most of their 5.8S sequences. 

The lengths of the complete ITS sequences were 573—672 bp. Of these aligned 
sites, 320 (48.34%) were constant, 213 (32.17%) had at least two nucleotide states in 
two or more sequences and were potentially phylogenetically informative, and 129 
(19.49%) were autapomorphies (Table 2). 

The sequence divergence of 1181, 5.85 апа ITS2 among ingroup species ranged 
from 0.0 to 23.9% whereas sequence divergence between ingroup and outgroup 
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ranged from 10.0 to 23.8%. The maximum sequence variation among ingroup 
species was 23.9%, between Kaempferia angustifolia and Scaphochlamys lanceolata. 
The maximum sequence variation between ingroup and outgroup was 23.8%, 
between Alpinia galanga and Scaphochlamys lanceolata. Apart from the identical 
ITS sequences of Hedychium coccineum, H. gardnerianum and Н. x raffillii, the 
lowest sequence variation among ingroup species was 0.09%, between Curcuma 
alismatifolia and C. parviflora. 

The sequence of Kaempferia elegans is the longest in this study (672 bp), and 
the highest ITS variation within a genus is 17.93%, between K. angustifolia and 
К. elegans. The maximum levels of variation of ITS within other genera are 15.37% 
(Scaphochlamys kunstleri and S. lanceolata), 11.83% (Boesenbergia cordata and 
В. gelatinosa), 7.06% (Camptandra ovata and С. parvula), 6.96% (Curcuma alis- 
matifolia and C. ecomata), 2.75% (Roscoea bhutanica and R. humeana), and 1.88% 
(Hedychium coccineum and H. sp.). 


Sequence analysis of the trnL-F region 


Alignment of trnL—F sequences of the 26 taxa analysed resulted іп a data matrix 
1008 bp long (Table 2). Ranges of the sequence at primer sites ‘d’ and “е” of three 
taxa, Cornukaempferia longipetiolata, Hedychium sp. and Kaempferia rotunda, 
lacked 25, 66 and 32 bp, respectively. The sequence of Distichochlamys citrea lacked 
the last 126 bp. 

The length of the complete trnL-F was 894-960 bp. Of these aligned sites, 885 
(87.80%) were constant, 38 (3.77%) had at least two nucleotide states in two or more 
sequences and were potentially phylogenetically informative, and 85 (8.43%) were 
autapomorphies (Table 2). 

The sequence divergence of the trnL-F intron and the trnL-F spacer among 
ingroup species ranged from 0.1 to 2.5%, whereas the sequence divergence between 
ingroup and outgroup ranged from 1.8 to 3.9%. The maximum sequence variation 
among ingroup species was 2.5% between Kaempferia angustifolia and Pyrgophyllum 
yunnanensis. The maximum sequence variation between ingroup and outgroup was 
3.9%, between Renealmia battenbergiana and Curcuma alismatifolia. The lowest 
sequence variation among ingroup species was 0.1%, between Boesenbergia auran- 
tiaca and Caulokaempferia violacea. However, when comparing the sequences of 
these two taxa, two indels are present, | and 7 bp long. 


Phylogenetic analysis of the ITS region 


Twenty-three most parsimonious trees from two islands, size 2 and 21, were 
obtained from parsimony analysis of the ITSI, 5.85 and ITS2 data sets from 
42 taxa, with length of 936, consistency index (CI)=0.5417, retention index 
(RI) =0.6374 and RC=0.3452. The strict consensus tree of the 23 most parsimo- 
nious trees is given in Fig. 1 with bootstrap values and decay indices. The average 
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Caulokaempferia violacea 
Boesenbergia basispicata 
Boesenbergia gelatinosa 
Boesenbergia longiflora 
Boesenbergia aff. longiflora 
Cornukaempferia longipetiolata 
Zingiber junceum 
Distichochlamys citrea 
Hedychium coccineum 
Hedychium gardnerianum 
Hedychium x raffillii 
Hedychium villosum 
Hedychium sp. 
Kaempferia angustifolia 
Kaempferia elegans 
Каетртепа rotunda 
Scaphochlamys kunstleri 
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Rhynchanthus beesianus 
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Camptandra parvula 
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Curcuma parviflora 
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Stahlianthus involucratus 
Curcuma amada 

Curcuma rubescens 
Hitchenia glauca 
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Curcuma ecomata 
Smithatris supraneanae 
Pyrgophyllum yunnanensis 


FIG. 1. Strict consensus tree of the 23 most parsimonious trees resulting from analysis of 
ITS data for 42 taxa. Upper numbers are bootstrap values of 1000 replicates. Lower numbers 


are decay indices (CI=0.542, RI=0.637, RC =0.345). 


number of nucleotide substitutions per character was high: 1.414 compared with 


0.149 for the trnL—F data set. 


The strict consensus tree accords with that of Kress et al. (2002). Three major 
clades are recognized, namely the Curcuma clade, the Camptandra clade and the 
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Hedychium clade. Pyrgophyllum resolves as sister to these clades, the relationships of 
which are unresolved. Only the Curcuma clade and the Camptandra clade are 
strongly supported. Resolution within the Curcuma clade is rather high (most 
subclades having BS >84 апа DI=3-13) showing that Curcuma L. is paraphyletic. 
The Curcuma clade also contains four morphologically very similar genera, 
Hitchenia Wall., Smithatris W.J. Kress & K. Larsen, Stahlianthus Kuntze and 
Paracautleya, the last of which was not sampled by Kress et al. (2002). Curcuma 
ecomata and Smithatris supraneanae form a subclade separate from the rest, 
but with weak support. Stahlianthus is found to be sister to Curcuma subgen. 
Hitcheniopsis and the HitchenialParacautleya clade is sister to Curcuma subgen. 
Curcuma. 

Although relationships within the Hedychium clade are not resolved, there are 
some well-supported clades. Scaphochlamys Baker species are grouped as a clade 
with strong support and Cautleya spicata is sister to Roscoea. Hedychium J. König 
species are grouped as a strongly supported clade while Kaempferia species are 
grouped as a clade with weak support. Our strict consensus tree shows one conflict 
with that of Kress et al. (2002). Where they found Caulokaempferia К. Larsen 
formed a clade, sister to all other members of the Zingibereae, we find it to be sister 
to Boesenbergia aurantiaca and B. cordata. As in Kress et al. (2002), the other 
Boesenbergia taxa form two clades, though as there are no species in common 
between the two studies it is not possible to say whether our data support their 
Boesenbergia І and Boesenbergia II clades. The other four species, В. basispicata, 
B. gelatinosa, B. longiflora and B. aff. longiflora, form a weakly supported clade, as 
do Pommereschea and Rhynchanthus. 

Successive weighting analyses produced a single most parsimonious tree (Fig. 3). 
However, this is not one of the 23 shortest trees resulting from an unweighted analy- 
sis. Figure 2 shows one of the 23 most parsimonious trees. Besides the clearer pat- 
tern of relationships, the positions of Pyrgophyllum (Gagnep.) T.L. Wu & Z.Y. Chen 
and Camptandra Ridl. in the successive weighting tree and in the strict consensus 
tree from an unweighted analysis are the most significant differences. Because the 
successive weighting analysis fails to replicate any of the most parsimonious trees 
from an unweighted analysis, we consider the result unreliable and do not discuss it 
further. 

The weighting of transversion over transition by an observed ratio (2/1) of 
the data set produced 14 most parsimonious trees (CI=0.5620, RI=0.6342, 
RC=0.3564). The strict consensus tree of these 14 is nearly identical to the strict 
consensus tree of an unweighted analysis, but with higher resolution, particularly 
within the Hedychium clade (Fig. 2). 

Maximum likelihood analysis recovered two optimal trees (In-likelihood 
=5551.712). One of the two optimal trees is presented іп Fig. 4. Two main subclades, 
as found in the strict consensus tree of ts/tv-applied search, can be recognized, namely 
the Hedychium clade and the Curcuma clade. Pyrgophyllum is basal in the Curcuma 
clade. The Pyrgophyllum/Camptandra clade is sister to the Curcuma complex. 
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Curcuma rubescens 
Hitchenia glauca 
Paracautleya bhatii 
Curcuma ecomata 
Smithatris supraneanae 


Pyrgophyllum yunnanensis 


10 changes 
Fic. 2. One of 23 most parsimonious trees resulting from unweighted analysis of ITS data 
for 42 taxa. Arrows denote collapsed branch in the strict consensus tree of the 14 most 


parsimonious trees resulting from the transition/transversion ratio applied analysis. 
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Fic. 3. The single most parsimonious tree resulting from successive weighting searches of 
ITS data for 42 taxa using a rescaled consistency index. Note that this tree is not one of the 
23 most parsimonious trees from an unweighted search. 


Topologies of this complex are identical to those found in the strict consensus tree of 
the ts/tv-applied search. Within the Hedychium clade, the subclade Cautleyal Roscoeal 
PommereschealRhynchanthus is sister to Hedychium. In turn, this CautleyalRoscoeal 
Pommerescheal Rhynchanthus!Hedychium clade is sister to the Boesenbergia group. 
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Fic. 4. The strict consensus tree of two equally optimal trees resulting from maximum 
likelihood analysis of ITS data for 42 taxa (In-likelihood = 5551.712). 


Haniffia may be sister to the remaining taxa. One difference in the topologies 
found in the maximum likelihood tree and the ts/tv-applied tree is the swapping 
of the Distichochlamys!Scaphochlamys clade and the Kaempferia clade, while the 
Boesenbergia clade, with Caulokaempferia nested in it, is the last branch and is 
identical in both trees. 


498 С. NGAMRIABSAKUL ET AL. 





Phylogenetic analysis of the trnL—F region 


Five most parsimonious trees on one island were obtained from parsimony analysis 
of the 26 taxa trnL—F complete region data set, with a length of 150, СТ=0.9067, 
RI=0.7879 and RC=0.7143. Successive weighting analyses produced the same set 
of trees as found in the unweighted analysis. The majority consensus tree of the five 
most parsimonious trees is given in Fig. 5 with bootstrap values and decay indices. 

Although there is less resolution in the consensus tree compared to that of the 
ITS data set, the trnL-F tree gives some phylogenetic information. The clades with 
moderate to strong support are those supported in the matK analysis of Kress et al. 
(2002). This is what would be expected from two genes in the chloroplast genome. 

The analysis of trnL-F gives moderate support to the hypothesis that Zingiber 
belongs in the Hedychieae (sensu Schumann). It also confirms that Caulokaempferia 
is derived within Boesenbergia. An obscure relationship, not found in the strict 
ITS consensus tree, was revealed when Camptandra parvula and Pyrgophyllum 
yunnanensis were grouped together, though with weak support. This relationship 
does not appear in the matK analysis of Kress et al. (2002). The same Curcuma com- 
plex as found in the strict ITS consensus tree (Curcuma, Paracautleya, Smithatris 
and Stahlianthus) was again revealed by the trnL-F data set, though only with mod- 
erate support. Curcuma (subgen. Hitcheniopsis), Smithatris and Stahlianthus were 
further, though weakly, supported as a clade. Hedychium appears as sister group to 
the Curcuma complex according to the trnL-F data set, yet with weak support. The 
species within Kaempferia, Roscoea and Scaphochlamys were each grouped together 
with weak to moderate support (BS=69, DI=2 in Kaempferia, BS=80, DI=3 in 
Roscoea and BS=51, DI=1 in Scaphochlamys). 


Phylogenetic analysis of the combined data sets 


The P-value, 0.734, resulting from the partition homogeneity test of both data sets 
indicates that there is considerable congruence in the phylogenetic information from 
the ITS and trnL-F data sets. The data sets were thus combined for a simultaneous 
parsimony analysis. Two most parsimonious trees from one island were obtained, 
with length of 882, CI=0.6406, RI=0.5681 апа RC=0.3639. The strict consensus 
tree is shown in Fig. 6. The tree recognizes the monophyly of the Zingibereae 
with strong support. Three major clades were identified in the Zingibereae, Cautleyal 
Roscoea, the Curcuma complex, and the Hedychium clade. However, there is no 
strong support for the relationships among these clades. Cautleya (Benth.) Hook.f. 
is identified as sister group to Roscoea. CamptandralPyrgophyllum appears as 
sister to the Curcuma complex, though the bootstrap value is less than 50%. Within 
the Curcuma complex, Smithatris is moderately supported as sister to the rest of the 
complex. Stahlianthus is grouped with Curcuma subgen. Hitcheniopsis, and 
Paracautleya is grouped with Curcuma subgen. Curcuma. 

The Hedychium clade is weakly supported and shows Hedychium as sister to the 
rest of the clade. The monophyly of Hedychium is strongly supported (BS = 100, 
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Fic. 5. The majority consensus tree of five most parsimonious trees resulting from analysis 
of trnL-F data for 26 taxa. Upper numbers are bootstrap values of 1000 replicates. Lower 
numbers are decay indices (CI=0.907, RI=0.788, RC=0.714). Arrows denote collapsed 
branch in the strict consensus tree. 


DI=20) as 15 that of Kaempferia (BS=91, DI=7) and Scaphochlamys (BS =98, 
DI=9), and a clade containing Boesenbergia aurantiaca and Caulokaempferia 
violacea is strongly supported (BS=100, DI=11). Nevertheless, relationships 
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Fic. 6. The strict consensus tree of the two most parsimonious trees resulting from analysis 


of the combined data set (ITS and trnL-F) for 26 taxa. Upper numbers are bootstrap values 
of 1000 replicates. Lower numbers are decay indices (CI=0.641, RI=0.568, RC =0.364). 


among these genera are not resolved with any significant support in this combined 
analysis. 

Successive weighting searches of the combined data set using the RC index 
produced a single most parsimonious tree (Fig. 7). The same two major clades 
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Fic. 7. The single most parsimonious tree resulting from successive weighting searches of 
the combined data set (ITS and ігпі.-Е) for 26 taxa using a rescaled consistency index. 





can be recognized, the Curcuma clade and the Hedychium clade. In the Curcuma clade, 
Camptandra and Pyrgophyllum are sister to a set of four morphologically very similar 
genera, Curcuma, Paracautleya, Smithatris and Stahlianthus. Smithatris is sister 
to ParacautleyalCurcuma (subgen. Curcuma) and Stahlianthus/Curcuma (subgen. 
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Hitcheniopsis). Within the Hedychium clade, Cautleyal Roscoea is sister to all of the 
rest. Hedychium is next separated as sister to the genera of the ‘Boesenbergia group’: 
Boesenbergia, Caulokaempferia, Cornukaempferia, Distichochlamys M.F. Newman, 
Kaempferia, Scaphochlamys and Zingiber. Boesenbergia is found to be paraphyletic; 
Caulokaempferia forms a clade with Boesenbergia aurantiaca, Distichochlamys is 
sister to Scaphochlamys, and Cornukaempferia and Zingiber are sister to each other. 


DISCUSSION 
Comparison with previous sequence 


The 1151 апа ITS2 sequences of Renealmia battenbergiana and Pleuranthodium 
schlechteri in this study are identical to those of the same taxa obtained by 
Rangsiruji et al. (2000а), and only one nucleotide of ITS2 was found to be different 
in Alpinia galanga. For the trnL-F spacer, the sequences of Alpinia galanga and 
Pleuranthodium schlechteri are identical to the sequences of Rangsiruji et al. (2000а). 
However, the first 37 nucleotides in the spacer of Renealmia battenbergiana of 
Rangsiruji et а!. (2000a) were different from this study. Nine unmatched nucleotides 
and one gap of eight nucleotides were observed in this region. Apart from this, the 
sequences differed by only two nucleotides. Each sequence obtained in this study 
was a complete region of trnL intron and trnL-F spacer from the sequencing of all 
four primers’ products (c, d, e and f). The reason why the sequence of Renealmia 
battenbergiana differs from that observed by Rangsiruji et al. (2000а) is unclear. 
Different PCR conditions amplify different sites of the region. In addition, the prob- 
lematic site is near the beginning of the primer which makes it more difficult to 
obtain the correct sequence using only one primer. 


Evolution of ITS and trnL—F 


The rate of mutation in ITS of the Zingibereae is about nine times faster than in the 
trnL-F region. As a result, phylogenetic relationships revealed by ITS are more fully 
resolved than those revealed by the trnL-F region. This has also been observed in 
Gentiana L., a genus of perennial dicotyledonous herbs (Gielly et al., 1996). 

The ІТ5І, 5.85 and ITS2 sequences of Hedychium were found to be markedly less 
variable than those of other genera in the Zingibereae. Their usefulness as phylo- 
genetic markers is thus minimal, as also observed by Wood et al. (2000). There are 
two possible explanations for this. Firstly it may be attributed to an exceptionally 
low mutation rate of sequences in Hedychium compared to other genera in the 
family. The other explanation is that the diversity of morphology found in the genus 
is large and outstrips the mutation rate of the ITS genes (rapid radiation). The latter 
may relate to the theory that morphology is normally held in equilibrium by stabiliz- 
ing selection for much of evolutionary time but is punctuated by relatively rapid 
speciation events (Bateman, 1999). This phenomenon may also have occurred in 
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Curcuma subgen. Hitcheniopsis where, though ITS sequence variation is low, the 
species are distinct morphologically. Another example can be found in Aframomum 
K. Schum. of the Alpinieae with 70 species (D.J. Harris, pers. comm.) where ITS 
variation is exceptionally low: 0–2.74% (Harris et а!., 2000). The mechanism can be 
further explained by ecological factors. Most Aframomum species are found on the 
boundary between forest and savanna; the ecological constraints of these habitats 
are normally large and have a profound effect on the morphology of the species 
(Harris et al., 2000). The different edges have rather specific conditions and these 
differences could exert a driving force for speciation. It may also be assumed that the 
distribution of an ancestor species was restricted, thus giving rise to a few species by 
peripheral isolation or fragmentation. 

In contrast, the sequence is very variable in Kaempferia which has the highest 
mutation rate of the genera in this study. It is also noticeable that its ITS sequences 
are polymorphic, suggesting that there may be more than one copy of the ribosomal 
gene, or low molecular drive to homogenize the gene. This would allow the presence 
of different copies of the gene and relaxation of the homogenization process, giving 
rise to the very variable ITS sequences found among Kaempferia species. The big 
deviation of the ITS mutation rate in Kaempferia and Scaphochlamys from the mean 
rate in other genera of the Zingibereae poses the potential problem of long branch 
attraction when analysed under a parsimony criterion (Felsenstein, 1978). Nonethe- 
less, no morphologically implausible groupings in the trees are observed. This may 
be due to the fact that the sampling in this study is quite representative. 

The ITS analyses give more accurate trees than trnL—F analyses because they 
contain more taxa and more informative sites in the data matrix. In addition, there 
is no strongly contradictory clade revealed by the analyses of the two genomes. 
Thus, the following discussion is based mainly on trees resulting from the ITS 
analyses, while the results of the trnL—F analyses are used as supporting evidence. 


The tribal position of Pommereschea and Rhynchanthus 


For many years Pommereschea апа Rhynchanthus have been placed in the А/ріпіеае, 
primarily because they lack lateral staminodes (Smith, 1981; Larsen et al., 1998). 
The present analysis confirms the finding of other recent analyses that they belong in 
the Zingibereae (Wood et al., 2000; Kress et al., 2002). 


Caulokaempferia 


Caulokaempferia violacea from northern Thailand is grouped with two Boesenbergia 
species from Borneo in this analysis. Nonetheless, we cannot disregard the pos- 
sibility that Boesenbergia may be polyphyletic and that Caulokaempferia may be 
unplaced, as found by Kress ег а!. (2002). Further sampling of Boesenbergia and 
Caulokaempferia species and studies of other lines of evidence are needed before 
suggesting any reclassification of Caulokaempferia. 
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The Curcuma complex 


All genera currently recognized in the Curcuma complex may be regarded as a 
single genus, Curcuma, though there are some morphological characters supporting 
the separation of each taxon, as seen in Table 3. These characters are however 
autapomorphic, as suggested by the present data. The acceptance of Hitchenia, 
Paracautleya, Smithatris and Stahlianthus as distinct genera leaves Curcuma as a 
paraphyletic genus within which infrageneric relationships are more complicated. 
Smithatris may be regarded as a distinct genus, though further sampling of Curcuma 
species may prove otherwise. 


SUMMARY 


The Zingibereae are found to be monophyletic and most of the clades within the 
tribe agree with those found by Kress et al. (2002), lending strong support to their 
new classification of the Zingiberaceae. This is clearly shown by the placement of 
Paracautleya bhatii which we find to be in the Zingibereae as predicted by Kress 
et al. 

Our results show some divergence from those of Kress et al. with regard to the 
relationships among Boesenbergia species and the placement of Caulokaempferia. 
What is clearly shown here is that Boesenbergia is paraphyletic with respect to 
Caulokaempferia, but sampling is insufficient for firm conclusions to be drawn. 

Two main subclades can be recognized in the tribe Zingibereae, namely the 
Hedychium clade and the Curcuma clade. 

Although few morphological synapomorphies in the tribes are readily observable 
in the field, exceptions occur, and we feel that the best course for the moment is to 
consider combinations of characters when placing genera. The classification of the 
Zingiberaceae seems never to be adequate when based just on a few morphological 
characters. Besides, as more and more molecular data become available for phylo- 
genetic investigation, convergence and reversals of morphological characters turn 
out to have occurred more often than previously thought. 
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